Optical theorem

In physics, the optical theorem is a general law of wave scattering theory, which relates the forward scattering amplitude to the total cross section of the scatterer. It is usually written in the form

\sigma_\mathrm{tot}=\frac{4\pi}{k}~\mathrm{Im}\,f(0),

where f(0) is the scattering amplitude with an angle of zero, that is, the amplitude of the wave scattered to the center of a distant screen. Because the optical theorem is derived using only conservation of energy, or in quantum mechanics from conservation of probability, the optical theorem is widely applicable and, in quantum mechanics, \sigma_\mathrm{tot} includes both elastic and inelastic scattering. Note that the above form is for an incident plane wave; a more general form discovered by Werner Heisenberg can be written

\mathrm{Im}~f(\bold{\hat{k}}', \bold{\hat{k}})=\frac{k}{4\pi}\int f(\bold{\hat{k}}',\bold{\hat{k}}'')f(\bold{\hat{k}}'',\bold{\hat{k}})~d\bold{\hat{k}}''.

Notice that as a natural consequence of the optical theorem, an object that scatters any light at all ought to have a nonzero forward scattering amplitude, and so a bright central spot.

History

The optical theorem was originally discovered independently by Sellmeier and Lord Rayleigh in 1871. Lord Rayleigh recognized the forward scattering amplitude in terms of the index of refraction as

 n = 1%2B2\pi \frac{Nf(0)}{k^2},

which he used in a study of the color and polarization of the sky. The equation was later extended to quantum scattering theory by several individuals, and came to be known as the Bohr–Peierls–Placzek relation after a 1939 publication. It was first referred to as the Optical Theorem in print in 1955 by Hans Bethe and Frederic de Hoffmann, after it had been known as a "well known theorem of optics" for some time.

Derivation

The theorem can be derived rather directly from a treatment of a scalar wave. If a plane wave is incident on an object, then the wave amplitude a great distance away from the scatterer is given approximately by

\psi(\bold{r}) \approx e^{ikz}%2Bf(\theta)\frac{e^{ikr}}{r}.

All higher terms, when squared, vanish more quickly than 1/r^2, and so are negligible a great distance away. Notice that for large values of z and small angles the binomial theorem gives us

 r=\sqrt{x^2%2By^2%2Bz^2}\approx z%2B\frac{x^2%2By^2}{2z}.

We would now like to use the fact that the intensity is proportional to the square of the amplitude \psi. Approximating the r in the denominator as z, we have

|\psi|^2=|e^{ikz}%2B\frac{f(\theta)}{z}e^{ikz}e^{ik(x^2%2By^2)/2z}|^2
=1%2B\frac{f(\theta)}{z}e^{ik(x^2%2By^2)/2z}%2B\frac{f^*(\theta)}{z}e^{-ik(x^2%2By^2)/2z}%2B\frac{|f(\theta)|^2}{z^2}.

If we drop the 1/z^2 term and use the fact that A%2BA^*=2~\mathrm{Re}~A we have

|\psi|^2\approx 1%2B2~\mathrm{Re}\left(\frac{f(\theta)}{z}e^{ik(x^2%2By^2)/2z}\right).

Now suppose we integrate over a screen in the x-y plane, at a distance which is small enough for the small angle approximations to be appropriate, but large enough that we can integrate the intensity from -\infty to \infty with negligible error. In optics, this is equivalent to including many fringes of the diffraction pattern. To further simplify matters, let's approximate f(\theta)=f(0). We quickly obtain

\int |\psi|^2~da \approx A %2B2~\mathrm{Re}\left(\frac{f(0)}{z}\int_{-\infty}^{\infty} e^{ikx^2/2z}dx\int_{-\infty}^{\infty} e^{iky^2/2z}dy\right)

where A is the area of the surface integrated over. The exponentials can be treated as Gaussians and so

\int |\psi|^2~da=A%2B2~\mathrm{Re}\left(\frac{f(0)}{z}\frac{i2z\pi}{k}\right)
=A-\frac{4\pi}{k}~\mathrm{Im}~f(0),

which is just the probability of reaching the screen if none were scattered, lessened by an amount (4\pi/k)~\mathrm{Im}~f(0), which is therefore the effective scattering cross section of the scatterer.

References